TDSGen: An Environment Based on Hybrid Genetic Algorithms for Generation of Test Data

نویسندگان

  • Luciano Petinati Ferreira
  • Silvia Regina Vergilio
چکیده

The software testing has gained importance and can be considered a fundamental activity to ensure the quality of the software being developed. In the literature, there are three groups of testing techniques proposed to reveal a great number of faults with minimal effort and costs: functional technique, structural technique and fault-based technique. These techniques are generally associated to testing criteria. A criterion is a predicate to be satisfied to consider the testing activity ended and the program tested enough [3]. The criterion generally requires the exercising of certain elements of the source code (decision statement) called required elements. To satisfy a testing criterion (coverage of 100%), it is necessary to provide input data to execute paths that exercise all the required elements. This is a very hard task because it is not possible its complete automatization due to several testing limitations. The generation of test data has been addressed by many authors and different techniques, such as: random generation, generation based on symbolic execution and generation based on dynamic execution, have been applied with varying degrees of success, maybe by the testing limitations and the complexity inherent to the test generation task. Therefore some authors proposed the use of metaheuristics algorithms, such as Genetic Algorithm, originating a new research field named Evolutionary Testing [4]. Many works address GA for test generation but most of them do not offer a testing environment for supporting the organization and the complete application of a testing strategy including different testing criteria. We explore the use of GA for test data generation and describe an environment, named TDSGen (Test Data Set Generator), that evolves a population of test inputs to satisfy different testing criteria. The goal is to obtain a population that represents a set that better satisfies the chosen criterion. TDSGen uses the coverage measurement for the fitness evaluation and implements different mechanisms based on the strategies: elitism, sharing and tabu lists. TDSGen integrates two different testing tools, Poketool [1] and Proteum [2], allowing to apply structural and fault-based criteria of C programs. These tools are responsible for the generation of the required elements and for the coverage evaluation, by producing the list of the covered elements. TDSGen presents other advantages: the tester provides a configuration file and initial functional test set, no analysis of the program is necessary, it implements a mechanism to encode the program input allowing the testing of different types of programs

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Appraisal of the evolutionary-based methodologies in generation of artificial earthquake time histories

Through the last three decades different seismological and engineering approaches for the generation of artificial earthquakes have been proposed. Selection of an appropriate method for the generation of applicable artificial earthquake accelerograms (AEAs) has been a challenging subject in the time history analysis of the structures in the case of the absence of sufficient recorded accelerogra...

متن کامل

Optimization of concrete structure mixture plan in marine environment using genetic algorithm

Today due to increasing development and importance of petroleum activities andmarine transport as well as due to the mining of seabed, building activities such as construction of docks, platforms and structures as those in coastal areas and oceans has increased significantly. Concrete strength as one of the most important necessary parameters for designing, depends on many factors such as mixtu...

متن کامل

A multi-objective genetic algorithm (MOGA) for hybrid flow shop scheduling problem with assembly operation

Scheduling for a two-stage production system is one of the most common problems in production management. In this production system, a number of products are produced and each product is assembled from a set of parts. The parts are produced in the first stage that is a fabrication stage and then they are assembled in the second stage that usually is an assembly stage. In this article, the first...

متن کامل

A New Hybrid model of Multi-layer Perceptron Artificial Neural Network and Genetic Algorithms in Web Design Management Based on CMS

The size and complexity of websites have grown significantly during recent years. In line with this growth, the need to maintain most of the resources has been intensified. Content Management Systems (CMSs) are software that was presented in accordance with increased demands of users. With the advent of Content Management Systems, factors such as: domains, predesigned module’s development, grap...

متن کامل

A hybrid model based on machine learning and genetic algorithm for detecting fraud in financial statements

Financial statement fraud has increasingly become a serious problem for business, government, and investors. In fact, this threatens the reliability of capital markets, corporate heads, and even the audit profession. Auditors in particular face their apparent inability to detect large-scale fraud, and there are various ways to identify this problem. In order to identify this problem, the majori...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004